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Abstract 

Consider the following stochastic model for immune response. Each pathogen gives birth 
to a new pathogen at rate A. When a new pathogen is born, it has the same type as its 
parent with probability 1 — r. With probability r, a mutation occurs, and the new pathogen 
has a different type from all previously observed pathogens. When a new type appears in 
the population, it survives for an exponential amount of time with mean 1, independently 
of all the other types. All pathogens of that type are killed simultaneously. Schinazi and 
Schweinsberg (2006) have shown that this model on if behaves rather differently from its 
non-spatial version. In this paper, we show that this model on a homogeneous tree captures 
features from both the non-spatial version and the Z version. We also obtain comparison 
results between this model and the basic contact process on general graphs. 

DD 



1 Introduction 

Schinazi and Schweinsberg (2006) have recently introduced a non-spatial and a spatial version 
of the following stochastic model for immune response. For the non-spatial version (Model 2 in 
their paper), each pathogen gives birth to a new pathogen at rate A. When a new pathogen is 
born, it has the same type as its parent with probability 1 — r. With probability r, a mutation 
occurs, and the new pathogen has a different type from all previously observed pathogens. When 
a new type appears in the population, it survives for an exponential amount of time with mean 
1, independently of all the other types. All pathogens of that type are killed simultaneously. 

For the spatial version (Model S2 in Schinazi and Schweinsberg (2006)), let S be the square 
lattice TL d or the homogeneous tree in which every vertex has d + 1 neighbors. Let x be a site 
in S occupied by a pathogen and y be one of its nearest neighbors. There are 2d such neighbors 
in Z d and d + 1 in T^. After a random exponential time with rate A, the pathogen on x gives 
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birth on y, provided y is empty (if y is occupied nothing happens). With probability 1 — r the 
new pathogen on y is of the same type as the parent pathogen on x. With probability r the new 
pathogen is of a different type. We assume that every new type that appears is different from all 
types that have ever appeared. As in the non-spatial model, we assume that each type survives 
for an exponential amount of time with mean 1, independently of all other types, and that all 
pathogens of a type are killed simultaneously. 

Biological motivation has been provided in Schinazi and Schweinsberg (2006). There is also 
some obvious motivation for such a model for the spread of a virus in computer networks. In 
particular, when a virus is discovered in a computer network it is usually destroyed at once. It is 
also interesting to test the influence of particular network topologies on the spread of a virus; see 
for instance Pastor-Sattoras and Vespignani (2001), Berger et. al. (2005), Ganesh et. al. (2005), 
and Draief et al. (2006). 

Schinazi and Schweinsberg (2006) have shown that the pathogens survive in the non-spatial 
model with positive probability if and only if A > 1 and r > 0; see their Theorem 1.2. In contrast, 
for the spatial model on Z rf they have shown that if A is large enough then the pathogens survive 
with positive probability for large r and die out with probability 1 for small r; see their Theorem 
4. Given that a ball of radius R has of the order of d R sites in the homogeneous tree Td one 
might conjecture that the behavior of the spatial model on Td is similar to that of the non-spatial 
model (there are so many sites on the tree that space may not be a limitation). As the reader 
will see in the next section, this is not so. In fact, the model on the tree captures both features 
from the model on Z rf and from the non-spatial model. These results will be stated in Section 2 
and proved in Sections 4 and 5. 

The basic contact process may be thought of as being a particular case of this model with 
r = 1. See Liggett (1999) for background and results on the contact process. For r = 1, all 
pathogens are of different types and therefore only one pathogen dies at a time. Hence, one 
might conjecture that the model with r = 1 has a better chance of surviving than any model 
with r < 1. This turns out to be true but requires nontrivial arguments. This will be done in 
Section 3. This comparison result applies to any graph. 



2 Phase transitions 



We say that the pathogens survive on a graph S if there is a positive probability that at all times 
there is at least one pathogen somewhere in S. If the pathogens do not survive they are said to 
die out. 

Theorem 1. Consider the contact process with mutations on the homogeneous tree Td with d > 2, 
started with a single pathogen of type 1 . 

1. If A > ^-j- then the pathogens survive for all r > 0. 

2- If A < d ~i+2r then the pathogens die out. In particular, if A < ^j-j- , the pathogens die out 
for all r > 0. 

3. If 1 d+ ^(ti~i) I+9d ^ < A < the pathogens die out for r close to and survive for r close 
to 1. 
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These results will be proved in Sections 4 and 5. 

Note that for all d > 2, the left-hand side of the inequality in Theorem 1.3 is strictly smaller 
than the right-hand side. Therefore, Theorem 1.3 shows that for intermediate values of A there 
is a phase transition in r. Moreover, the difference between the right and left sides of Theorem 
1.3 is asymptotic to ^ as d j oo. Combining Theorem 1.1 and 1.2 , we see that the set of A's 
for which there is a phase transition in r is of size asymptotically at most Jj. So, up to a factor 
of three, we have found the correct size of this set. 

We now turn to another type of phase transition. Let At be the set of sites that are occupied 
by any pathogen. The process is started with a single pathogen at the site x. Recall that 
pathogens are said to survive if 

P(A t ^0,Vt >0) >0. (1) 
The pathogens are said to survive weakly if ([T]) holds and 

P(x ^ At for sufficiently large t) = 1. (2) 

That is, with positive probability, there are pathogens somewhere in S for all times, but almost 
surely a given site gets infected only finitely many times. If P(x G At for arbitrarily large t) > 0, 
then the process is said to survive strongly. 

The interest in interacting particles on trees was sparked by Pemantle (1992). He proved 
that, in contrast to what happens on Z d , there exists an intermediate range of A's for which the 
basic contact process on a homogeneous tree survives weakly; see Liggett (1999) for a complete 
description of the results and people involved. We next give a partial result in the same direction 
for the contact process with mutations. 

Theorem 2. Consider the contact process with mutations on the homogeneous tree If < 
A < ^7=| then the pathogens survive weakly for all r > 0. 

This proves that for d > 6, there is a range of A for which the contact process with mutations 
survives weakly. We believe this result to hold for all d > 2. Theorem 2 will be proved at the 
end of Section 3. 

3 Comparison between the contact process with mutations and 
the basic contact process 

Let S be a connected undirected graph. Let A t be the set of sites occupied by a pathogen at time 
t for the contact process with mutations on S. A typical state is a finite subset A of S, together 
with a partition {A 1 } of A; A 1 is the set of sites of type i. Denote the semigroup and generator of 
the process by S r (t) and L r . Note that the mutation model with r = 1 can be viewed as a version 
of the basic contact process, but only if all initial types are distinct; otherwise they are not the 
same. To make the comparison with the basic contact process, it is convenient to make it have 
the same birth mechanism as the multitype version. We will carry along the information about 
types, but ignore that information in the death mechanism - each individual dies separately at 
rate 1. We will use primes when referring to this process. Thus the generators are given by 

L r g(A) = Yjg^A*) - g(A)] + A £ £ [rg^A) + (1 - r)g(V?A) - g{A)\ 

i i x^A i ,y^A,x^y 
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and 

L' r g(A) = Y, l9(A\{x}) - g(A)] + A £ £ [r^A) + (1 - r)^ A) - 

where U y A = V^A = iU {y} as sets, but with different partitions: In U y A, {y} is added as a 
new set in the partition, while in V?A, y is added to A 1 . Here x ~ y means thar x and y are 
neighbors in 5*. 

We begin with a result about the basic contact process, which is an extension of Theorem 6.2 
of Harris (1974). His result is the following statement in the special case S = Z d and C = Z d . 
The proof below is an adaptation of the proof of Proposition 5.9 of Chapter III of Liggett (1985). 
(As a historical note, we point out that Harris proved his result using coupling rather than duality. 
He didn't discover contact process duality until two years later.) Here P A refers to probabilities 
when A is the initial state of the process at time zero. 

Lemma 3. For all A, B,C C S and every t > 0, 

p AnB (A' t n c + 0) + p AuB (A' t n c + 0) < P A (A' t nc/0) + p B (A' t n c + 0). 

Proof. By checking cases, one sees that 

l{C*n(ylnB)^0} + l{Cn(AuB)^0} < 1{CTU^0} + l{cns^0} 
for all A, B, C. Replacing C by A' t and taking expectations gives 

P c (A' t n (A n B) ^ 0) + P c (A' t n(AuB)^Q))< P c (A' t n A + 0) + P c (A' t DB^tD). 
Now use duality (Theorem 1.7 of Chapter VI of Liggett (1985)). □ 

Here is the basic comparison result: 
Theorem 4. For every A, C C S, < r < 1, and t > 0, 

P A {A t n C + 0) < P A (A' t n C + 0). 
Proof. Let f(A) = ljAnc*^}- The integration by parts formula gives 

P A (A[ n c ± 0) - p A (A t n C ^ 0) = s;(t)/(A) - 5 r (t)/(A) 

= / 5 r (s)(4-L P )^(t- S )/(A)d8. 

JO 

The easiest way to check this identity is to integrate the derivative with respect to s of 
S r (s)S' r (t — s)f from to t, recalling that while the two semigroups do not commute, each 
semigroup commutes with its own generator. By the lemma, g(A) = S' r (t — s)f(A) (note that 
g(A) depends on A only through the set of occupied sites - not on its partition by type) satisfies 

g(AnB) + g(AuB)<g(A) + g(B) (3) 

for all A, B. Thus it suffices to show that (L' r — L r )g > for all g that depend on A only through 
the set of occupied sites and satisfy ([3]). Of course, L r g(A) does depend on the types. So, write 
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A = UiA 1 , where {A 1 } is the partition of A according to type. Then since the terms corresponding 
to births agree for the two generators, 



(L' r -L r )g(A) = Y,l9(A\{x})-g(A)]-Y,l9(A\A*)-g(A)] 

x£A i 

= E [ E l9(A\{x}) - g{A)\ - g(A\A i ) + g(A) 

We will check that this is nonnegative by checking that each summand (in the sum on i) is 
nonnegative. So, we need to check that if B C A, then 

J29(A\{x})>g(A\B) + (\B\-l)g(A). 

x£B 

To deduce this from ([3|), write B = {m, ...,Uk}- Then ([3|) implies that 

9{A\{ Uj }) >g(A\{ Ul ,..., Uj }) -g(A\{ Ul ,..., Uj -i}) + g(A) (4) 
for each j. Now sum (j3J) for 1 < j < k to complete the proof. □ 

Remark 5. By taking C = S in Theorem 4, we get that for any graph S and any A > 0, if 
the basic contact process dies out, then the contact process with mutations dies out for every 
r G [0,1]. 

Proof of Theorem 2. By Theorem 1.1, the contact process with mutations survives for A > 
Therefore, it remains only to verify ([2]) when A < — Recall that x is the location of the 
pathogen at time zero. Following Section 1.4 of Liggett (1999), define a function t : — > Z such 
that l(x) = and, for each y £ T^, we have £(z) = £(y) — 1 for exactly one neighbor z of y, 
while £(z) = £(y) + 1 for the other d neighbors of y. We can think of £(y) as being the generation 
number of y, with x belonging to generation zero. 
For < p < 1, define 

w p (A) = J2p £(x) - 

xgA 

For the basic contact process, we have 

E[w p (A' t )] < exp[(Adp + Xp- 1 - l)t]w p (A' Q ). (5) 

This can be seen from the differential equation in (4.4) on p. 80 of Liggett (1999). Alternatively, 
equation (4.6) on p. 81 of Liggett (1999) gives ([5]) with equality for the branching random walk, 
so the inequality follows from comparing the contact process with the branching random walk. 

Since initially there is just one pathogen at x, we have w p (A' ) = 1. Also, since (JH) holds for 
all p, we can optimize the bound by choosing p = \j\fd~. If A < ^7^, then 7 = Xdp+Xp^ 1 — 1 < 0. 
Since w p (A' t ) > 1 whenever x £ A' t , it follows that 

P(x G A' t ) < E[w p {A' t )\ < exp( 7 i). 
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Since 7 < 0, we have 

/ P(x £ A' t ) dt < 00. 
Jo 

Using Theorem 4 with A = C = {x} to compare the basic contact process and the contact process 
with mutations, we get P{x £ At) < P(x £ A' t ) for all t. Thus, 



E 



f'OO poo 

/ l {xeAt} dt = P(x £ A t ) dt < 00. (6) 
io J io 



On the event that x is infected infinitely many times, almost surely the Lebesgue measure of the 
set of infected times is infinite. Therefore, ([2]) follows from (|6|). □ 

Remark 6. Because P(x £ A t ) < P(x £ A' t ), we have 

lim P(x £ At) = (7) 

whenever this result holds for the basic contact process. In particular, if A < A2, where A2 is the 
upper critical value for the contact process, that is, the infimum over the set of A for which the 
contact process survives strongly, then ([7]) holds for all r £ [0, 1]. Pemantle (1992) showed that 
A2 > 0.309 when d = 5 and A2 > 0.354 when d = 4. These bounds, combined with Theorem 
1.1, show that when d = 4 or d = 5, there is a range of A such that for all r £ [0, 1], the contact 
process with mutations survives but ([7]) holds. For d > 6, Theorem 2 gives the stronger result of 
weak survival for a range of A. 

Remark 7. Unlike the case of the contact process, many monotonicity statements for At are 
either false or hard to prove. To illustrate, we note that the following statement is false, even 
when S has only two elements: 

"For all A,i and r, P A (At 7^ 0) is increasing in A for sets A of a single type." 

To see this, let S = {x, y}, f(A) = lim^oo P A {A t 7^ 0) for a fixed t > 0, and for simplicity take 
r = 1/2. Then it is not hard to check that 

f({x,y}) = e- t , /({*}, {y}) = (l + t)e-*, /({*}) = (l + e~*. 

(Here {x, y} is the configuration with two particles of the same type, {x}, {y} is the configuration 
with two particles of different types, and {x} is the configuration in which x is occupied and y is 
vacant.) Therefore, 

f({x,y})<f({x})<f({x},{y}). 

In particular, we do not know whether the survival probability P^(At 7^ Vt) is monotone in 
r for fixed A, even though Theorem 4 suggests that this is probably the case. 
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4 Proofs of Theorems 1.1 and 1.2 



We start with the proof of Theorem 1.1. The proof requires some extra care because of the 
monotonicity issues mentioned in Remark 

To prove this result, we first establish that our particle system dominates a particle system 
that obeys the same rules except that only one birth can ever happen on any given site, and no 
births can occur on sites y for which (,{y) < 0, where I is the function defined in the proof of 
Theorem 2. Note that this second restriction amounts to running the process on the rooted d-ary 
tree with x as the root, so that x has d neighbors and the other vertices have d + 1 neighbors. 
Therefore, it will suffice to give a condition for this modified particle system to survive. 

To couple the two particle systems, we construct them from the same collection of Poisson 
processes. For each nonzero integer k, let (Zf )?2 :1 be the times of a Poisson process of rate 1 on 
[0, oo). For each pair of adjacent vertices v and w in T^, let (T^' w )^ =1 be the times of a Poisson 
process of rate A on [0, oo) and let (^' w )^ l be a sequence of i.i.d. Bernoulli random variables 
that are one with probability r and zero with probability 1 — r. All of these processes are assumed 
to be independent of one another. 

To construct the two particle systems, we start with an individual of type one at x. At the 
times T- ' , if site v is occupied and site w is vacant, the individual at site v gives birth at site w, 
except that for the second particle system, this birth is suppressed if there was a previous birth 
at site w or if £(w) < 0. If & ' = 0, then the new individual born at site w has the same type 
as the individual at site v. If ^' = 1, then this individual has a new type. If this new type is 
the kth type that appears simultaneously in both particle systems, then we label it type k. If it 
is the kth type to appear in one particle system but not the other, then we label the type —k. 
Therefore, the types indexed by positive integers are coupled, while those indexed by negative 
integers are not. At the times T*, all individuals of type k (if there are any) die. 

Let A t be the set of occupied sites in the first particle system at time t, and let B t be the set 
of occupied sites in the second particle system. For nonzero integers k, let A\ be the set of sites 
at time t inhabited by a type k individual in the first process, and let B\ be the set of sites at 
time t inhabited by a type k individual in the second process. The following lemma is sufficient 
to imply that the first process survives for all t whenever the second process does. 

Lemma 8. For all t, we have B\ = for all k < and B\ C A\ for all k > 0. 

Proof. Fix any vertex w 6 T^. Let Vq, v%, . . . , v s be the vertices on the unique path from x to w, 
with vo = x and v s = w. Suppose w S B t for some t. This means that for j = 1, . . . , s, the virus 
must have spread from vj-i to Vj by time t. Therefore, i(vj) > for j = 0, 1, . . . , s. 

If Tl < T^°' Vl , then the individual at vq dies before the virus spreads to v\. In this case, the 
site v\ can never become infected in the second process because, by the rules of the second process, 
i>o can never be reinfected. Alternatively, if > T?°' V1 , then the site v± becomes infected in both 
particle systems at time t\ = x^°' Vl , and by definition the site is assigned the same (positive) 
type number in both particle systems. Therefore, if v\ is ever infected in the second particle 
system, then it is infected in both particle systems by particles of the same positive type at the 
same time t\. 

Proceeding by induction, suppose it is true that if Vj is ever infected in the second particle 
system, then it is infected in both particle systems by a particle of the same type rrij > at 
the same time Tj. Then, if min-fT™^ : j > tj} < min{T^ j ' Vj+1 : T? j ' 3+1 > Tj}, the individual 
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at Vj dies before spreading the infection to Vj+i, and the site Vj + \ can never become infected in 
the second particle system because Vj can never be reinfected. Otherwise, the site Vj + \ becomes 
infected in both particle systems at time Tj+i = mm{T^ j ' Vj+1 : T^ j ' Vj+1 > tj} and therefore gets 
the same type number in both particle systems. Thus, if Vj+i is ever infected in the second 
particle system, then it is infected in both particle systems at the same time by particles of the 
same positive type. 

By induction, we conclude that for all w £ Td, if w is ever infected in the second particle 
system, then it is infected in both particle systems at the same time by particles having the same 
positive type. This implies B k = for all k < 0. This also implies that the particle on site w will 
die at the same time in both particle systems. Since the site can never become reinfected in the 
second particle system, the result B\ C A k for all k follows. □ 

Lemma 9. Let U be the tree such that 1 is the root of the tree and £ is a child of k if the first 
type I individual in the second particle system has a type k individual as its parent. Then U is a 
Galton- Watson tree. 

Proof. To prove this lemma, we will modify the Poisson process construction of the second particle 
system. Rather than working with the Poisson processes iT^ ,w )^ l and (^' w )^ =1 , we will define, 
for each positive integer k, a Poisson process (T k,v,w )'^ 1 of rate A on [0, oo) and a sequence 
(^i ,v,w )i^i of i.i.d. random variables that are one with probability r and zero with probability 
1 — r. All of the processes are again assumed to be independent. We construct the particle system 
as before, except that when there is a type k individual at the site v, the times at which this 
individual attempts to give birth on the site w are the times T k,v,w , and the random variables 
gk,v,w determine whether the new individual born has a new type or a different type. Note also 
that is sufficient to consider only the vertices v such that all vertices w on the path from x to v 
satisfy i{w) > 0. These vertices form a tree TJ, which we can think of as being rooted at x, in 
which x has d neighbors and the other vertices have d+1 neighbors. 

For all positive integers k, let Xk denote the number of children of the vertex labeled k (if 
there is one) in the tree U. Fix a vertex u € Td, and let vq,vi, . . . ,v k be the vertices on the 
unique path from x to u. Then, a type 1 individual gives birth to a new type on u if and only if, 
for j = 1, . . . , k — 1, the type 1 individual on site Vj—i gives birth to another type 1 individual on 
site Vj, and then the type 1 individual on site k — 1 gives birth to an individual of a new type on 
site k, all before the type 1 individuals die. These events depend only on T* and the processes 
(T^ w )r=i ^d & v > w )£v so there is a function / such that X 1 = f(T±, (T^ w )?=n & V ' W )£i)- 

Now, suppose there is a type k individual born at some time. Let T k be the time at which 
the first type k individual is born, and let Vk be the vertex at which the first type k individual is 
born. Note that r k and Vk depend only on the evolution of the first k — 1 types and therefore are 
functions of T(, (T?' V,W )'£L 1 , and {£j ,v,w )'£Li for j = 1, . . . , k — 1. Also, note that type k individuals 
and their children can only be born in the subtree rooted at v^ (consisting of the vertices w such 
that the path from w to x includes Vk), which is empty until time Tfc. This means that the type 
k individuals evolve in the subtree rooted at Vk just as the as the type 1 individuals evolve in 
TJ. More precisely, let 4> be a graph automorphism mapping to the subtree of TJ rooted at 
v k such that <f>{x) = v k . Also, let J k = min{i : Tf > r k } and J k > v > w = mm{i : T t k ' v ' w > r fc }. Then 

v — f(T k —t fT fc ' ( M 1 >)' ( K u ') _ \ oo ,f.k,4>{y),4>{w) noo \ 

^k — jy-Ljk Iki i+ jk,<t,(v),<P(™)-l 'k)i=li \S i+ jk,4>(v),4>(w)_i)i=l) ■ 
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From this fact and the translation invariance of Poisson processes, it follows that the conditional 
distribution of Xj- given X\,... ,Xk-i is the same as the unconditional distribution of X\. This 
is enough to establish that U is a Galton- Watson tree. □ 

Proof of Theorem 1.1. When r = 1, the result is a known fact about the basic contact process 
(see, for example, Theorem 4.1 on p. 79 of Liggett (1999)). Assume now that r < 1. By 
Lemma [SI it suffices to prove that the second process survives with positive probability whenever 
A > l/{d—l). Note that the second process survives for all times if and only if the Galton- Watson 
tree U is infinite. Therefore, we need to show that this Galton- Watson process is supercritical 
when A > l/(d — 1) 

The mean of the offspring distribution can be computed explicitly. We will take k = 1 and 
count the expected number of type 1 individuals born, not counting the one alive at time zero. 
Note that for all j > 1, there are d J vertices x G such that £{x) = j. Also, if a type one vertex 
is born on a given site x, the probability that a type 1 vertex is born on a child of this site (that 
is, a neighboring vertex y such that £(y) = i(x) + 1) is the probability that this birth, which 
happens at rate A(l — r), happens before either a mutant is born on this site, which happens at 
rate Ar, or the type dies, which happens at rate 1. Therefore, this probability is A(l — r)/(A + 1). 
It follows that the probability that a type 1 individual is born on a given vertex x € with 
£( x ) = j is — r)/(A + l)p. Therefore, the expected number of type 1 individuals born is 

Since the expected number of new types born to type 1 individuals is r/(l — r) times the expected 
number of type one individuals born, the mean of the offspring distribution of the tree U is 




which is infinite if r < (d — l)/d and A > l/(d — 1 — dr). Otherwise, we have 

r ^/dA(l-r)V'_ r / dA(l - r) \_ dXr 
1-r ^ V A + 1 J ~ l-r\X + l-d\(l-r)J ~ 1 - (d - 1)A + dXr ' 

which (when the mean of the offspring distribution is finite) is greater than one if and only if 
A > l/(d — 1). This completes the proof of Theorem 1.1. □ 

We now turn to Theorem 1.2. 

Proof of Theorem 1.2. When r = 1, the result is a known fact about the contact process. By 
using a comparison between the contact process and the branching random walk, one can deduce 
this result from Theorem 4.8 in Section 1.4 of Liggett (1999). When r = 0, all pathogens have 
the same type and therefore die out after an exponential time. Assume now that < r < 1. 

We first construct the original particle system from Poisson processes in a slightly different 
way. For each positive integer k, let (Zf )°i 1 be the times of a rate one Poisson process on [0, oo). 
For each pair of adjacent vertices v and w in and each positive integer k, let (j'M."')?^ De 
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a Poisson process of rate A on [0,oo). For each positive integer k and each vertex v in Tj, let 
(£ k ' v )i^Li De a sequence of i.i.d. random variables that are one with probability r and zero with 
probability 1 — r. All type k individuals die at the times Tf. If, at time T^' v,w , there is a type k 
individual at site v and the site w is vacant, then the individual at site v gives birth onto the site 
w at this time. To determine the type of this new individual, we examine the random variable 
£j ,w if this is the jth time that a type k individual has given birth onto w. If = 0, then the 
new individual also has type k. Otherwise, it has a new type. No types are assigned negative 
numbers, and the £th type to appear will be called type I. Let A\ be the set of sites having type 
k at time t, and let A t be the set of occupied sites at time t. 

To make our comparison argument work, we will define for each positive integer k an additional 
process consisting only of type k individuals. We will define the new process from the same 
collection of Poisson processes as the original process. If the first type k individual in the original 
process is born at site at time r^, then the new process will be empty until time at which 
time a type k individual appears at the site vu- Thereafter, the new process evolves with the same 
rules as the original process, except that individuals of types other than k are killed instantly 
after being born. Let D\ be the set of occupied sites in this process at time t. 

Let Cfc = min{T^ : > T&}, which is the time at which the type k individuals die in both 
processes, so A\ = D\ = for t > (k- For all w G T^, let g(w) = min{j : £j ,w = 0}. Note that for 
both processes, there is a type k individual born at site w if and only if there are at least g(w) 
times before (,k when a type k individual gives birth on site w. Also, note that once a type k 
individual is born on site w, there can be no further births on site w until time so there can 
never be more than g(w) times at which a type k individual gives birth on site w. Let h,A, w {t) be 
the number of times in the original process that a type k individual gives birth on the site w by 
time t, and let fiD jW (t) be the number of times in the additional process that an individual gives 
birth on the site w by time t. 

We claim that h,A, w (t) < ho, w (t) for all t G [r^, Cfc] and all w G Td, and that A\ C D\ for all 
t G bfc,Cfc]- Since we clearly have hA, w (t) < ho,w{t) and A\ C D\ when t = and the type k 
individuals die at the same time in both processes, we need only to show that these results do not 
break down at any of the times T k,v,w when type k births may take place. Suppose t = x k,v,w , 
and that we have fiA, w {t—) < hn^ w {t—) and A\_ C If v ^ A%_, then there can be no birth 

in the original process at time t, and therefore h,A, w (t) < hD, w (t) and A\ d D\. Suppose instead 
v G A\_ C D\_ . Then there are four possibilities. 

• If w G A t - and w G D%_, then there is no birth at time t in either process, so fiA, w (t) < 
h DtW (t) and A k t C D\. 

• If w G A t _ and w ^ D%_, then h,A, w (t) = hA, w {t—) and ho,w{t) = ho,w{t—) + 1- Again it is 
clear that h,A,w(t) < h,D, w (t) and if C flf. 

• If w ^ At- and w ^ D*_, then there is a birth at time t in both processes. Therefore, 
hA,w(t) = hA, w (t—) + 1 and h D>w (t) = h D)W (t-) + 1, so we have h,A, w (t) < h D>w (t). If 
the individual born in the original process has type k, then tiA, w (t) = g(w), which implies 
h>D,w(t) = g{w) and therefore a type k individual is born in both processes. Thus, we still 
have A\ C D\. 

• If w £ A t - and w G D\_, then it is clear that A\ C D\. Also, the individual at site w 
at time t— in the additional process must have type k, as only type k individuals appear 
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in this process. This means that h,D >w (t—) = g{w). Since h,A, w {t) < 9{ w ) f° r an ^ the 
inequality h,A, w (t) < ho,w(t) holds also. 

These observations imply the claim. 

Now, let Yfc be the number of new types born to type k individuals at the site w in the original 
process, and let be the number of new types born to type k individuals at the site w in the 
additional process. Note that Y\. = mm{g(w) — 1, h.A,w(Ck)} an d Z\. = m.m.{g(w) — 1, hr> )W {Ck)}- 
Therefore, it follows from the claim that Y\. < Z^. Also, because is a transitive graph, the 
conditional distribution of Z^ given Z\, . . . , Z^-i and given that a type k individual appears at 
some time is the same as the distribution of Z\ . 

Using the original process, define the tree V such that 1 is the root of the tree and I is a 
child of k if the first type i individual has a type k individual as its parent. Note that the vertex 
k has Yfe descendants in the tree V. Since < Z^ for all k, it follows that the distribution of 
the total progeny of V is stochastically dominated by the distribution of the total progeny in a 
Galton- Watson tree whose offspring distribution is the same as the distribution of Z\. Therefore, 
if E[Zi] < 1, then V is finite almost surely, which implies that the process dies out almost surely. 

The computation for E[Z\] differs from the computation in the proof of Theorem 1.1 in two 
ways. First, there are (d+ 1) ■ d?~ x vertices at a distance j from x. Second, because new types die 
immediately, if there is a type one present on a given site, a type one will be born on a child at 
rate A(l — r)j (A(l — r) + 1) because this birth only must happen before the type dies. Therefore, 
the expected number of new types born to type 1 individuals in this model is 

(d + l)r / rfA(l-r) V 
d(l-r)j^ VA(l-r) + iy 

This is infinite when A > l/[(d — 1)(1 — r)]. Otherwise, we have 

(d+l)r ^ / dA(l - r) y _ (d + lW d\(l-r) \ _ (d+l)r\ 

d(l-r)j^ VA(l-r) + iy ~ d(l - r) \ A(l - r) + 1 - d\{\ - r) J ~ 1 - (d - 1)(1 - r)A' 

which is less than or equal to one if and only if A < 1/ (d — 1 + 2r) . □ 

5 Proof of Theorem 1.3 

In this section, we adapt the proof of Theorem 2.2(h) of Pemantle (1992) to prove that the process 
sometimes survives even for A < — 1). We begin with the following general result. 

Lemma 10. Let Xt be a pure jump Markov process on a countable set S with transition rates 
q(x,y),x ^ y, set q{x) = Yly.y^x Q( x > 2/)> an( ^ ^ f be a nonnegative function on S. Suppose that 
there exist positive e and M so that 

(i) f(y) — f(x) > —M whenever q(x, y) > 

and 

(H) ^y.\f(y)-f(x)\<M q( x >y)[f(y) - f( x )} > e v( x ) for all x£S. 
Then there is an a > so that E x exp[— af(X t )] is a decreasing function oft for all x G S. In 
particular, if f(x) > 0, r is the hitting time of {y : f(y) = 0}, and this set is absorbing, then 
P x {t < oo) < 1. 
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Proof. Choose 7 > so that e u < 1 + u + u 2 for \u\ < 7, and write for < a < 7/M 



d_ 

dt 



t=o 



e -<*f(y) _ e -af(x) 



< e -a/(x) £ g(x,y) 

V:|/(l/)-/(*)l<Af 
J/:j/^x,|/(j/)-/(x)|<M 



e a[/(x)-/(j/)] _ x 

- /(y)] + a 2 M 2 



< e -af(x) 



eaq(x) + a 2 M 2 q(x 



which is < provided that aM 2 < e. Setting a = mmfa/M, e/M 2 ) and using the Markov 
property gives the monotonicity result. To check the final statement, use the fact that if P x (t < 
00) = 1, then f(X t ) is eventually 0. This would contradict the monotonicity statement. □ 

Returning to the mutation process, for a configuration A, let N(A) be the number of types 
in A, and C(A) be the number of components of A. We will apply the lemma with f(A) = 
aN(A) + (3C{A) for an appropriate choice of a > and (3 > 0. Note that A^(^4) can only increase 
by 1 or decrease by 1 at each transition. On the other hand, C(A) can decrease by at most 
d (if a particle is born at a site with all neighbors occupied), but can increase by arbitrarily 
large amounts (if a type that is connected to a large number of other types dies). So f(A t ) 
has bounded downward jumps but unbounded upward jumps, unlike the situation Pemantle 
considered. Nevertheless, by choosing M sufficiently large, which we now do, we satisfy (i) of the 
lemma. 

Turning to assumption (ii), we consider first the contributions coming from changes in N(A). 
The number of types decreases by one at rate N(A), since each type dies at rate 1. The number 
of types increases by one at rate 

Ar#((x, y):x€A,y<£A) = \r[(d - 1)\A\ + 2C(A)}. 

This is exactly the same as Pemantle's computation, except that there is an extra factor of r 
coming from the fact that each new particle is of a new type with probability r. Combining these 
observations leads to 

J2q(AB)[N(B) - N(A)} = —N(A) + Xr[(d - 1)\A\ + 2C(A)}. 



The rate at which C(A) decreases as the result of particle births is at most {d + l)XC(A), 
as shown by Pemantle. The major difference between Pemantle's situation and ours comes next. 
Consider one component C of A with k > 2 types, in which type i is connected to ji other types. 
As Pemantle showed, 

k 

J>-l)=*-2. (8) 

i=i 
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If the ith. type dies, j% — 1 new components are formed. Thus we need to find a lower bound for 

E c*- 1 )- 

i:ji<M+l 

In doing so, we will use ([8]), together with the simple inequalities 

ji<(d- l)L4*|+2, (9) 

k 

\C\=Y\A> \ A *\+ E 1 ( 10 ) 

i=l i:ji>M+l i:ji<M+l 

and (using ([8]) and the fact that ji > 1) 

fe-2> Yl (ji~l)>(M + l) y L ( n ) 

i:ji>M+l i:ji>M+l 

Putting these together gives: 



Y Ui-i) = (k-2)- y (i*- 1 ) 

i:ji<M+l i:ji>M+l 

>(fc-2)- E [(d- 1)1^*1 + 1] 

i:ji>M+l 

>(k-2)- Y l-(d-l)\\C\- y 1 

i:ji>M+l L i:ji<M+l ■ 

= (fc-2)-(d-l)|C| + (d-l)A:-d Y 1 

i:ji>M+l 

> kd-2-{d-l)\C\-d k ~ 1 



> kd- 



M 



M + l 



M + l 
{d-\)\C\-2. 



We have used ([9]) in the first inequality, (fTOl) in the second, and (jTTI) in the third. Note that 
this inequality is trivially true in the case k = 1 in which the entire component is of one type. 
Summing the above inequality over all components gives 



N{A) 



Y(3i-l)l{ji<M+i}>N(A)d 



M 



M + l 



It follows that 



E 



q(A,B)[C(B)-C(A)\ > N(A)d 



B:\C(B)-C(A)\<M 



M 

M + l 



(d-l)\A\-2C(A). 



(d - 1)\A\ - 2C(A) -(d + 1)\C{A). 



Now take f{A) = aN{A) + f3C(A). Note that when M is sufficiently large, we can only 
have \f{B) — f(A)\ > M with q(A, B) > when the death of a single type leads to the creation 
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of many components. That is, \f(B) - /(A)| > M if and only if N(B) - N(A) 
C{B) - C(A) > (M + a)/ /3. It follows that for sufficiently large M, 

£ g(A,B)[/(B)-/(A)] 

B:|/(B)-/(i4)|<Af 

> ?(4 B) [N(B) - N(A)] + £ <?( A, £) [C(fl) - C(A)] 

-B S:|C(B)-C(A)|<M//3 



•1 and 



> a 



+ /3 
iV(A) 



N(A) + Ar(d - 1)|A| + 2ArC(A) 
M 



N(A)d 



M + [3 



(d - 1)\A\ - 2C{A) - (d + l)AC(A) 



a + (3d 



M 



M + /3 



+ \A\ 



a\r(d - 1) - (3{d - 1) 



+ C(A) 



2Ara - 2f3 - /9(d + 1)A 



Since the total jump rate q(A) is bounded above by a multiple of |A|, and N(A) < |A|, 
assumption (ii) in the lemma will be satisfied if, in the expression above, the coefficient of |A| is 
strictly positive, the sum of the coefficients of N(A) and | A| is strictly positive, and the coefficient 
of C(A) is nonnegative. This will be true for large M if 

aXr > p, 0>a[l- Xr(d - 1)], and 2Xra > (3[2 + X(d + 1)]. 

There exist positive a and [3 satisfying these inequalities if Xr(d— 1) > 1, or if Xr(d— 1) < 1 and 



2 + A(c?+l) 



< 



2Xr 1- Xr(d- 1) ' 

i.e., if A is greater than the positive root of 

X 2 r(d 2 - 1) + X(2rd - d - 1) - 2 = 
Applying the lemma, we now have the following result. 
Theorem 11. The contact process with mutations survives if 



X > 



d+l-2rd + y/(d + l) 2 + 4r(d + l)(d - 2) + Ad 2 r 2 



2r{d? - 1) 

Combining this result with Theorem 1.2, we see that for 



(12) 



+ V(d-l)(7 + 9d) 
2{d? - 1) 



< A < 



d-1 



the process dies out for small r and survives for r close to 1. Note also that while Theorem 11 
gives a better lower bound than Theorem 1.1 when r is close to one, the lower bound in Theorem 
1.1 is better for small r, as the expression on the right-hand side of (|12p approaches infinity as 
r -> 0. 
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